×

policy iteration meaning in Chinese

策略迭代法

Examples

  1. The aco algorithms are fitted into the framework of generalized policy iteration ( gpi ) in rl based on incomplete information of the markov state . furthermore , we show that the pheromone update in the acs and ant - q algorithm is based on the mc methods or some formalistic combination of mc methods and td methods
    此外在强化学习的理论框架内说明了as算法是一种基于蒙特卡洛方法的强化学习算法, acs和ant - q算法是一种蒙特卡洛方法与瞬时差分方法在形式上相结合的强化学习算法。

Related Words

  1. iteration iteration
  2. policy
  3. policy management
  4. migration policy
  5. loading policy
  6. coercion policy
  7. named policy
  8. tranche policy
  9. canal policy
  10. immigration policies
  11. policy intellectuals
  12. policy issue
  13. policy legitimation
  14. policy limit
PC Version

Copyright © 2018 WordTech Co.